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plicated in Alzheimer's disease (Fig, 3B, cy- 
toskeietal/neuronal cluster). Most of the otber 
known genes in this cluster are implicated in 
neuronal pathfmding and cell adhesion, in- 
cluding E-cadherin, which encodes a protein 
associated with the presenilin complex (28), 
and Notch, which encodes a substrate of the 
presenilin complex (29, 30). The cluster of 21 
genes is enriched for components and sub- 
strates of the presenilin complex. 

These data (24) provide an overview of 
gene expression profiles during Drosophila de- 
velopment An unusually high proportion of the 
genes are developmental^ regulated, but of 
4028 genes analyzed, only 903 are previously 
named Drosophila genes with a known mutant 
phenotype, biochemical function, or protein ho- 
mology. Fifty -one percent of the genes fall into 
50 clusters with correlation coefficients greater 
than 0.80 (for an annotated hierarchical cluster, 
see fig. S7, green bars). Virtually all the clusters 
contain genes with known or predicted roles in 
development or physiology, and genes to which 
a biochemical or cellular function has been 
assigned by the GO project (12) [all genes in 
these clusters are listed in the online database 
(24)]. A large number of the clusters contain 
genes that are used together in specific devel- 
opmental or biochemical processes. On the ba- 
sis of their developmental expression patterns, 
we have tentatively assigned 53% of the genes 
to a. developmental or biological functional cat- 
egory (for example, male germ line, female 
germ line, eye, muscle, early zygotic, biochem- 
ical complex, or cell biology function). 

In addition to providing functional an- 
notation of the Drosophila genome, these 
studies are a step toward a complete de- 
scription of the genetic networks that con- 
trol development. 
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Structural Basis for Gluten 
intolerance in Celiac Sprue 

Lu Shan, 1 0yvind Molberg, 5 Isabeile Parrot, 1 Felix Hausch, 1 
Ferda Filiz, 1 Gary M Gray, 2 Ludvig M. SolLid, 5 
Chaitan Khosla 1 ' 3 - 4 * 

Celiac Sprue, a widely prevalent autoimmune disease of the small intestine, is 
induced in genetically susceptible individuals by exposure to dietary gluten, A 
33-mer peptide was identified that has several characteristics suggesting it is 
the primary initiator of the inflammatory response to gluten fn Celiac Sprue 
patients, in vitro and in vivo studies in rats and humans demonstrated that it 
is stable toward breakdown by all gastric, pancreatic, and intestinal brush- 
border membrane proteases. The peptide reacted with tissue transglutaminase, 
the major autoantigen in Celiac Sprue, with substantially greater selectivity 
than known natural substrates of this extracellular enzyme. It was a potent 
inducer of gut-derived human T cell lines from 14 of 14 Celiac Sprue patients. 
Homologs of this peptide were found in all food grains that are toxic to Celiac 
Sprue patients but are absent from all nontoxic food grains. The peptide could 
be detoxified in in vitro and in vivo assays by exposure to a bacterial prolyl 
endopeptidase, suggesting a strategy for oral peptidase supplement therapy for 
Celiac Sprue. 



Celiac Sprue (also known as Celiac disease or 
gluten-sensitive enteropathy) is an autoim- 
mune disease of the small intestine caused by 
the ingestion of gluten proteins from widely 
prevalent food sources such as wheat, rye, 
and barley, In many human leukocyte antigen 
(HLA) DQ2 (or DQ8)-positive individuals, 
exposure of the small intestine to gluten m- 
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duces an inflammatory response, leading to de- 
struction of the villous structure of the intestine 
(7-5). It commonly appears in early childhood, 
with severe symptoms including chronic diar- 
rhea, abdominal distension, and failure to 
thrive. In many patients, symptoms may not 
develop until later in life, when the disease 
symptoms include fetigue, diarrhea, and weight 
loss due to malabsorption, anemia, and neu- 
rological symptoms. Celiac Sprue is a life- 
long disease, and if untreated it is associated 
with increased morbidity and mortality (4, 5). 
Despite its high prevalence in most popula- 
tion groups (>1 :200) and serious manifesta- 
tions, the only effective therapy is strict di- 
etary abstinence from these food grains. 
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The principal toxic components of wheat 
gluten are a family of closely related Pro- and 
Gin-rich proteins called gliadins (6, 7). How- 
ever, given the enormous biological diversity 
and unusual chemistry of gluten proteins, and 
the absence of satisfactory assays for gluten 
toxicity, the structural basis for gluten toxic- 
ity in Celiac Sprue remains unclear (2). Not- 
withstanding the heterogeneity of T cell 
epitopes in glutei^ a few epitopes appear to 
account for most of the «-gIiadin-speciflc 
recognition by CD4 + T cells from patients (#, 
9). These peptides are also substrates of tissue 
transglutaminase (tTGase) (10, 11), and the 
products of this enzymatic reaction bind to 
the HLA DQ2 molecule (12, 13), 

To identify the physiologically stable re- 
gions of gliadin., recombinant a2-gliadin, a 
representative a-gliadin (14), was digested 
with gastric and pancreatic enzymes and an- 
alyzed by liquid chromatography coupled 
with tandem mass spectroscopy and ultravi- 
olet spectroscopy (LC-MS/MS/UV) (15) 
(Fig. 1). The most noteworthy of the diges- 
tive products was a relatively large fragment, 
the 33-mer LQLQPFPQPQLPYPQPQLPYPQP- 
QLPYPQPQPF (residues 57 to 89) (16). This 
peptide was of particular interest for two 
reasons: (i) Whereas most other relatively 
stable proteolytic fragments were cleaved to 
smaller fragments when the reaction times 
were extended^ the 33-mer peptide remained 
intact despite prolonged exposure to pro- 
teases, (ii) Three distinct patient-specific T 
cell epitopes identified previously in T cell 
proliferation assays (8, 17) are present in this 
peptide, namely, PFPQPQLPY, PQPQL- 
PYPQ (three copies), and PYPQPQLPY (two 
copies). In addition to this Gin- and Pro-rich 



33-mer, the peptide WQIPEQSR was also 
identified and was used as a control in many 
of the studies that followed. Under similar 
proteolytic conditions, myoglobin (another 
common dietary protein) is rapidly broken 
down into much smaller products (18). No 
long intermediate is observed to accumulate. 

The small intestinal brush-border mem- 
brane (BBM) enzymes are vital for breaking 
down any remaining peptides from gastric or 
pancreatic digestion into amino acids, dipep- 
tides, or tripeptides (19). BBM fractions were 
prepared from rat small intestinal mucosa 
(20). The specific activities of known BBM 
peptidases were verified to be within the 
previously reported range (21). Whereas the 
half-life of disappearance of WQTPEQ3R 
was —60 min in the presence of BBM protein 
(12 ng/jxl), the half-life of LQLQPF- 
PQPQLPYPQPQLPYPQPQLPYPQPQPF 
digestion was >20 hours (18). Therefore, the 
latter peptide must remain intact throughout 
the digestive process in the stomach and 
upper small intestine and is poised to act as a 
potential antigen for T cell proliferation and 
intestinal toxicity in genetically susceptible 
individuals. 

To validate the initial findings with rat 
BBM preparations in human small intestine, 
we prepared small intestinal biopsy material 
taken as part of the care of five individuals, 
one of whom carried the diagnosis of Celiac 
Sprue and was in remission. The other four 
patients proved to have normal intestinal his- 
tology. LQLQPFPQPQLPYPQPQLPYPQP- 
QLPYPQPQPF, QLQPFPQPQLPY (an in- 
ternal sequence from the 33-mer used as a 
control), WQIPEQSR, and other control pep- 
tides (100 jxM) were incubated with BBM 



lx -gliadin composite digestion! 



Fig. 1. Products of 
gastric plus pancrea- 
tic protease mediated 
digestion of ot2-glia- 
din under physiologi- 
cal conditions. Analy- 
sis was performed by 
reverse-phase liquid 
chromatography cou- 
pled with eiectrospray 
ionization mass spec- 
troscopy (LC-ESIMS). 
The longest peptides 
are highlighted by ar- 
rows coded to indicate the sequence of «2-gliadin 
{bottom right). Although pepsin-catalyzed cleav- 
age of the NH 2 -terminal Leu residue of the red- 
coded 33-mer was observed only in prolonged 
Incubations, this residue was included in the later 
analysis because it stabilized the peptide from 
noncatalytic conversion into pyro-Gin peptide. 
(Pyroglutamination readily occurs during both 
synthesis and biochemical assays involving pep- 
tides with NH 2 -terminal Gin residues.) 




MVRVPVPQLQPQNPSQQQPQEQVPLVQ 

QQQFPGQQQPFPPQQPYPQPQPFPSQQ 
PY/LQLQPFPQPQkP YPQPQL P YPQPQL 
PYPQPQPFRPQQPYPQSQPQYSQPQQP 
I SQQQQQQQQQQQQKQGQQQQQQILQQ 
I LQQQL I PCRDWLQQHS I AYGS SQVL 
QQSTYQLVQQLCCQQLWQIPEQ SRCQA 
IHNWHAI I LHQQQQQQQQQQQQPLSQ 
VSFQQPQGQYPSGQGSFQPSQQNPQAQ 
GSVQPQQL PQFEE IRNLALETL PAMCN 
VYI P PYCTI APVGIFGTNYR 



prepared from each human biopsy (final ami- 
nopeptidase N activity -13 [xU/pJ, total pro- 
tein — 1 u,g/|xl) at 37°C for varying time 
periods. Although QLQPFPQPQLPY, WQIP- 
EQSR, and other control peptides were near- 
ly completely proteolyzed within 1 to 5 
hours, the 33-mer peptide remained largely 
intact for at least 15 hours (Table 1). 

The proteolytic resistance of the 33-mer 
gliadin peptide, observed in vitro with BBM 
from rats and humans, was confirmed in vivo 
by a perfusion protocol in intact adult rats 
(22). Whereas >90% of QLQPFPQPQLPY 
was proteolyzed in the perfusion experiment, 
the 33-mer gliadin peptide was highly resis- 
tant to digestion (Fig. 2). These results dem- 
onstrate that the 33-mer peptide is very stable 
when it is exposed to the BBM of the mam- 
malian upper small intestine, 

Regiospecific deamidation of immuno- 
genic gliadin peptides by tTGase increases 
their affinity for HLA DQ2 as well as the 
potency with which they activate patient-de- 
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Fig. 2. In vivo BBM digestion of peptides. (A) 
Reverse-phase liquid chromatography at UV 
215 nrn (RPLC-UV 215 ) trace of 25 jxM of 
LQLQPFPQPQLPYPQPQLPYPQPQLPYPQPQPF 
(shown by arrow) before perfusion and after 
perfusion of a 20-cm segment of upper small 
intestine (residence time - 20 min). {B) RPLC- 
UV 21S trace of 50 of QLQPFPQPQLPY 
(shown by arrow) before perfusion and after 
perfusion of the 20-cm segment {residence 
time = 20 min). *, pyroQLQPFPQPQLPY. 

Table 1. Human BBM catalyzed digestion {%) 
{over 15 hours) of LQ LQPFPQPQLPYPQPQLPYPQ- 
PQLPYPQPQPF ("33-mer"), QLQPFPQPQLPY 
{"Control A") r and WQIPEQSR {"Control B"} de- 
rived from a panel of adult biopsies. Asterisk indi- 
cates participant diagnosed with Celiac Sprue. 
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rived gluten-specific T cells 9, 14, 23). 
The specificity of tTGase for certain short 
antigenic peptides derived from gliadin is 
higher than its specificity toward its physio- 
logical target site in fibronectin (24, 25). The 
kinetics and regiospecificity of deamidation 
of the 33-mer a-gliadin peptide were there- 
fore measured. The £ cal /K M (specificity) was 
higher than that reported for any peptide stud- 
ied thns far (26). Analysis with LC-MS and 
MS revealed that, although the deamidation 
pattern of LQLQPFPQPQLPYPQPQLPYPQ- 
PQLPYPQPQPF was complex, mono-deami- 
dated products at the underlined Gin residues 
accumulated with time. This is consistent 
with the observed regioselectivity of human 
tTGase (8, 24), Because preliminary results 
indicate that tTGase activity is associated 
with the BBM of intestinal enterocytes (J8> 
27)^ it is likely that dietary intake of even 
small quantities of wheat gluten will lead to 
the build-up of sufficient quantities of this 
33-mer gliadin peptide in the intestinal lu- 
men, which will be recognized and processed 
by tTGase. 

The eenirality of this 33-mer in the patho- 
genesis of Celiac Sprue is highlighted by the 
observation that not only is the deamidated 
product an excellent substrate for tTGase, but 
it is a very potent stimulator [median effec- 
tive concentration (EC 50 ) —80 nM] of three 
different HLA DQ2-restricted T cell clones 
derived from intestinal biopsies of Celiac 
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Sprue patients stimulated with gluten (28) 
(Fig. 3). Each clone has been shown to rec- 
ognize a distinct epitope found in the 33-mer 
(PFPQPQLPY, PQPQLPYPQ, and PYPQP- 
QLPY, respectively) (8, 17). Moreover, 
freshly prepared T cell lines from most Celiac 
Sprue biopsies are stimulated by one or more 
of these epitopes, suggesting collective in> 
munodominance of these epitopes. Testing of 
randomly selected polyclonal T cell lines 
from 14 different Celiac Sprue patients (Ta- 
ble 2) revealed that each line was very sen- 
sitive to the deamidated 33-mer peptide 
(EC 50 < 1 fi,M). Therefore, we interpret the 
combination of metabolic stability and mul- 
ti valency of the 33-mer to endow it with 
exceptional toxic potency against the small 
intestinal mucosa. These findings vividly re- 
inforce the pathological importance of recent 
observations that initiation of a T cell-medi- 
ated inflammatory response requires the mul- 
tivalent engagement of T cell receptor-major 
histocompatibility complexes (MHCs) by an 
antigenic peptide (29, 30). They also suggest 
how this multivalent peptide (or its selective- 
ly deamidated form), which is intrinsically 
inert toward digestive breakdown, might be 
used orally for vaccination, prevention, and/ 
or treatment of Celiac Sprue (31, 32). 

Sequence alignment searches using 
BLASTP in all nonredundant protein databases 
revealed several homologs (E value < 0.001) of 
the 33-mer gliadin peptide. Food grain- derived 



homologs were only found in gliadms (from 
wheat), hordeins (from barley), and secalins 
(from rye), all of which are toxic cereals in the 
Celiac diet (6) (fig. SI). Nontoxic food grain 
proteins, such as avenins (in oats), rice, and 
maize, do not contain homologous sequences to 
the 33-mer gliadin. In contrast, a BLASTP 
search with the entire a2-gliadin sequence iden- 
tified food grain protein homologs from both 
toxic and nontoxic proteins. On the basis of 
available information regarding the substrate 
specificities of gastric, pancreatic, and BBM 
proteases and peptidases (19), we predict that, 
although many gluten homologs of the 33-mer 
gliadin peptide contain proteolytic sites and are 
therefore likely to be digested over time, several 
sequences from wheat, rye, and barley can be 
expected to be comparably resistant to gastric 
and intestinal proteolysis as LQLQPF- 
P QP QLP YPQPQLP YPQPQLP YPQPQPF 

The primary sequence of the 33-mer gli- 
adin peptide aiso had homologs among a few 
nongluten proteins. Among the strongest ho- 
mologs were interna! sequences from pertac- 
tin (a highly immunogenic protein from Bor- 
detella pertussis) and a mammalian protein 
tyrosine phosphatase of unknown function. In 
both cases, available information suggested 
that this homology could have biological rel- 
evance. For example, the region of pertactin 
that is homologous to the 33-mer gliadin 
peptide is known to be part of the hnmuno- 
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Fig. 3, Stimulation of three HLA DQ2-restricted T cell clones (TCC) 
derived from intestinal biopsies of Celiac Sprue patients by the 
33-mer and shorter epitopes. EC 50 indicated by arrows. Maximal 
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responses used for the estimation of EC 50 values were determined in 
independent experiments. (A) TCC 380.E2. (B) TCC 430.1.135, (C) TCC 
370 E-3.19. 



Table 2. Comparative response of human polyclonal T celt lines (coded by 
numbers) derived from intestinal biopsies of Celiac Sprue patients to LQLQPF- 
PQPQLPYPQPQLPYPQPQLPYPQPQPF (33-mer) and representative gtiadin 
epitopes defined by earlier studies. Each antigen was pretreated with tTGase 
to elicit an enhanced T cell response. Peptide 1206 (YQQLPQPQQPQQSF- 



PQQQRPF; corresponding to -y-f); Peptide 1258 (PQPQLPYPQPQLPY; corre- 
sponding to «-ll); Peptide 1305 (IIQPQQPAQ- corresponding to y-U); Peptide 
1317 (LQPQQPFPOQPQQPYPQQPQ; corresponding to 7-III + 7-V). Blank 
eel! implies no response was observed for these peptides against the corre- 
sponding T cell lines at deamidated peptide concentrations up to 10 ^M. 
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dominant segment of the protein (34). In the 
case of the phosphatase homolog, the protein 
is blown to undergo vesicular trafficking into 
the cytoplasmic Golgi (35). By analogy with 
the current understanding of how gliadin pep- 
tides are acquired by HLA-DQ2 via a 
tTGase-mediated pathway (J?), we hypothe- 
sized that these Pro-Gln-rich segments of 
both pertactin and the phosphatase are likely 
to be high-affinity tTGase substrates. To test 
this hypothesis, we synthesized the corre- 
sponding peptides and measured the selectiv- 
ity of tTGase for these. As predicted, both 
peptides were found to be good substrates of 
tTGase (36). Studies by Pastan and co-work- 
ers showed that tTGase plays a key role in 
receptor-mediated endocytosis of several bi- 
ologically important proteins (37). Therefore, 
we therefore propose that the biological ac- 
tivities of both pertactin and the phosphatase 
depend on tTGase-mediated trafficking. 
Analogous to gliadin in Celiac Sprue, pertac- 
tin elicits a vigorous antibody response 
because it reacts with tTGase on the extracel- 
lular surface of antigen presenting cells to 
produce a long-lived intermediate that is in- 
ternalized via endocytosis and presented to 
the immune system via the class II MHC- 
mediated pathway. If this analogy holds, 
then, drawing upon the Celiac Sprue analogy, 
one could predict that tTGase-mediated en- 
docytosis might be a highly effective mech- 
anism for oral vaccination with the use of 
immunogenic peptide epitopes, as long as 



they are intrinsically resistant to the action of 
pepsin, trypsin, chymotrypsm, and elastase. 
By flanking such epitopes with proteolyti- 
cally stable high-affinity tTGase substrates 
[e.g., the sequence PQPQLPYPQPQLP from 
gliadin (21)], they could be protected from 
exposure to potent pancreatic and intestinal 
exopeptidases and would therefore have suf- 
ficiently long half-lives to permit efficient 
stimulation of the gut-associated lymphoid 
system. Secondary structural studies using 
circular dichroism spectroscopy on the 33- 
mer gliadin peptide and its homologs from 
pertactin and the tyrosine phosphatase reveal 
that these peptides have strong type II 
polyproline helical character (fig. S2). In ad- 
dition to reinforcing the proteolytic resistance 
of these peptides, the type II polyproline he- 
lical conformation is typical of peptides 
bound to class II MHC proteins and is likely 
to enhance their binding to these proteins (38, 
39). 

The abundance and location of proline 
residues is a crucial factor contributing to the 
resistance of the 33-mer gliadin peptide to 
gastrointestinal breakdown. Therefore, we 
hypothesized that a prolyl endopeptidase 
could catalyze breakdown of this peptide, 
thereby diminishing its toxic effects. Prelim- 
inary in vitro studies with short gliadin pep- 
tides and the prolyl endopeptidase (PEP) 
from Flavobacterium menirtgosepticum sup- 
ported this hypothesis (21). The ability of this 
PEP to cleave the 33-mer gliadin peptide was 



LQLQPFPQPQLPYPQPQLPYPQPQLPYPQPQPF 



0 min 




t = 60 min 



t= 110 min 



10 



11 



12 



13 14 15 16 



17 



16 19 20 min 



B 



LQLGFFPQPQLPYPQPQLPYPQPQLPYPQPQPF 

before perfusion 



perfusion, no PEP 



perfusion with PEP 



75000 -^~P1 + BBM(10')+PEP4tTGase 
PI+BBWpO'J+PEP+tTGase 
-^-PI+BBMW+PEP+tTGase 
50000 - "*~'P1 + BBM{4hr)+PEP+rrGa» 



25000 



10 



15 



20 



25 



30 min 




0.01 0.1 1 
Concentration [uM] 



Fig. 4. Breakdown and detoxification of 33-mer gliadin peptide with PEP. (A) RPLC-UV Z15 traces of 
the 33-mer gliadin peptide {100 jxM) incubated in vitro with PEP {540 mU/ml). (B) RPLC«UV Z15 
traces of in vivo digested 33-mer gliadin peptide (25 jjlM) with and without PEP (25 mU/ml) in the 
rat small intestine (residence time — 20 mtn). (C) Stimulation of T cell clone TCC 380.E2 (specific 
for QPFPQPELPY) by 33-mer gliadin peptide after PEP (500 mU/ml) and BBM (dipeptidy! peptidase 
activity 500 mU/ml) treatment for different durations, followed by tTGase treatment The data for 
TCC 430.1.135 and TCC 370 E-3.19 were similar (not shown; see Fig. 3 caption). 



evaluated in vitro (Fig. 4A) and in vivo using 
the rat intestinal perfusion model (Fig. 4B). 
In the latter assay, the synergistic effect of 
PEP and BBM peptidases was evident. More- 
over, the T cell stimulatory potential of PEP- 
treated peptide was shown to decrease rapidly 
(Fig. 4C). Given the preference of PEP for 
Pro-Xaa-Pro tripeptides (40) and the abun- 
dance of this motif in immunogenic peptides 
from gluten (41), these results highlight the 
potential of detoxifying gluten in Celiac 
Sprue patients by peptidase therapy. 
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